Joint Regression and Association Models for Repeated Categorical Responses
نویسندگان
چکیده
The focus of this study is on statistical analysis of categorical responses, where the response values are dependent of each other. The most typical example of this kind of dependence is when repeated responses have been obtained from the same study unit. For example, in Paper I, the response of interest is the pneumococcal nasopharengyal carriage (yes/no) on 329 children. For each child, the carriage is measured nine times during the first 18 months of life, and thus repeated respones on each child cannot be assumed independent of each other. In the case of the above example, the interest typically lies in the carriage prevalence, and whether different risk factors affect the prevalence. Regression analysis is the established method for studying the effects of risk factors. In order to make correct inferences from the regression model, the associations between repeated responses need to be taken into account. The analysis of repeated categorical responses typically focus on regression modelling. However, further insights can also be gained by investigating the structure of the association. The central theme in this study is on the development of joint regression and association models. The analysis of repeated, or otherwise clustered, categorical responses is computationally difficult. Likelihood-based inference is often feasible only when the number of repeated responses for each study unit is small. In Paper IV, an algorithm is presented, which substantially facilitates maximum likelihood fitting, especially when the number of repeated responses increase. In addition, a notable result arising from this work is the freely available software for likelihood-based estimation of clustered categorical responses. Jukka Jokinen, Joint regression and association models for repeated categorical responses Kansanterveyslaitoksen julkaisuja, A21/2006, 35 sivua ISBN 951-740-677-0; 951-740-678-9 (pdf-versio) ISSN 0359-3584; 1458-6290 (pdf-versio) http://www.ktl.fi/portal/4043
منابع مشابه
Fast estimation algorithm for likelihood-based analysis of repeated categorical responses
Likelihood-based marginal regression modelling for repeated, or otherwise clustered, categorical responses is computationally demanding. This is because the number of measures needed to describe the associations within a cluster increase geometrically with increasing cluster size. The proposed estimation methods typically describe the associations using odds ratios, which result in computationa...
متن کاملLog-mean linear regression models for binary responses with an application to multimorbidity
In regression models for categorical data a linear model is typically related to the response variables via a transformation of probabilities called the link function. We introduce an approach based on two link functions for binary data named log-mean (LM) and log-mean linear (LML), respectively. The choice of the link function plays a key role for the interpretation of the model, and our appro...
متن کاملRepeated ordinal measurements: a generalised estimating equation approach
Cumulative logit and related regression models for ordered categorical data may be expressed as generalised linear models for correlated binary responses. These may be fitted using the generalised estimated equation approach of Liang and Zeger (1986) and yields nearly identical results to maximum likelihood while offering further flexibility. The approach also generalises to deal with repeated ...
متن کاملPartial Association Components in Multi-way Contingency Tables and Their Statistiical Analysis
In analyses of contingency tables made up of categorical variables, the study of relationship between the variables is usually the major objective. So far, many association measures and association models have been used to measure the association structure present in the table. Although the association measures merely determine the degree of strength of association between the study varia...
متن کاملNonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys
In many surveys, the data comprise a large number of categorical variables that suffer from item nonresponse. Standard methods for multiple imputation, like log-linear models or sequential regression imputation, can fail to capture complex dependencies and can be difficult to implement effectively in high dimensions. We present a fully Bayesian, joint modeling approach to multiple imputation fo...
متن کامل